A Visual Interactive Framework for Attribute Discretization

نویسندگان

  • Ramesh Subramonian
  • Ramana Venkata
  • Joyce Chen
چکیده

Discretization is the process of dividing a continuousvalued base attribute into discrete intervals, which highlight distinct patterns in the behavior of a related goal attribute. In this paper, we present an integrated visual framework in which several discretization strategies can be experimented with, and which visually assists the user in intuitively determining the appropriate number and locations of intervals. In addition to featuring methods based on minimizing classification error or entropy, we introduce (i) an optimal algorithm that minimizes the approximation introdnced lp discretimtinn ad (ii\ a nnvd al!mrithm --L---2-_‘-.>--A-_ \.., 1 -_.-_ -~o-~‘L-that uses an unsupervised learning technique, clustering, to identify intervals. We also extend discretization to work with continuous-valued goal attributes.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

COCOVILA - Compiler-Compiler for Visual Languages

A compiler-compiler for visual languages is presented. It has been designed as a framework for building visual programming environments that translate schemas into textual representation as well as into programs representing the deep meaning of schemas. The deep semantics is implemented by applying attribute grammars to schema languages; attribute dependencies are implemented as methods of Java...

متن کامل

A New Hybrid Framework for Filter based Feature Selection using Information Gain and Symmetric Uncertainty (TECHNICAL NOTE)

Feature selection is a pre-processing technique used for eliminating the irrelevant and redundant features which results in enhancing the performance of the classifiers. When a dataset contains more irrelevant and redundant features, it fails to increase the accuracy and also reduces the performance of the classifiers. To avoid them, this paper presents a new hybrid feature selection method usi...

متن کامل

Interactive Focus+Context Analysis of Large, Time-Dependent Flow Simulation Data

Visualization of time-dependent simulation data, such as datasets from CFD simulation, still is a very challenging task. In this paper, we present a new approach to the interactive visual analysis of flow simulation data which is especially targeted at the analysis of time-dependent data. It supports the flexible specification and visualization of flow features in an interactive setup of multip...

متن کامل

A Framework for Optimal Attribute Evaluation and Selection in Hesitant Fuzzy Environment Based on Enhanced Ordered Weighted Entropy Approach for Medical Dataset

Background: In this paper, a generic hesitant fuzzy set (HFS) model for clustering various ECG beats according to weights of attributes is proposed. A comprehensive review of the electrocardiogram signal classification and segmentation methodologies indicates that algorithms which are able to effectively handle the nonstationary and uncertainty of the signals should be used for ECG analysis. Ex...

متن کامل

A Multi-attribute Reverse Auction Framework Under Uncertainty to the Procurement of Relief Items

One of the main activities of humanitarian logistics is to provide relief items for survivors in case of a disaster. To facilitate the procurement operation, this paper proposes a bidding framework for supplier selection and optimal allocation of relief items. The proposed auction process is divided into the announcement construction, bid construction and bid evaluation phases. In the announcem...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997